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PRODUCTION OF HETEROLOGOUS 
POLYPEPTIDES FROM FRESHWATER CAULOBACTER 



» «» 4 W Vfl LUTVIUIVIi 



This invention relates to the use of the Caulobacter surface layer protein 
(S-layer protein) transport system for the expression and secretion of heterologous 

polypeptides from a host organism. 



1 0 Background of the Inve ntion 



Many genera of bacteria assemble layers composed of repetitive, regularly 
aligned, proteinaceous sub-units on. the outer surface of the cell. These layers are 
essentially two-dimensional paracrystalline arrays, and being the outer molecular layer 

15 of the organism, directly interface with the environment-. Such layers are commonly 
known as S-layers and are found on members of every taxonomic group of walled 
bacteria including: Archaebacteria; Chlamydia : Cvanobacteria : Acinetobacter . 
Bacillus; Aquaspirillum ; Caulobacter; Clostridium : Chromathim . Typically, an' 
S-layer will be composed of an intricate, geometric array of at least one major protein 

20 having a repetitive regular structure. In many cases/such as in Caulobacter . the 
S-layer protein is synthesized by the cell in large quantities and the S-layer completely 
envelopes the cell and thus appears to be a protective layer. 

Caulobacter are natural inhabitants of most soil and freshwater environments 
2 5 and may persist in waste water treatment systems and effluents. The bacteria alternate 
between a stalked celt that is attached to a surface, and an adhesive motile dispersal 
cell that searches to find a new surface upon which to stick and convert to a stalked 
cell, The bacteria attach tenaciously to nearly ali surfaces and do so without 
producing the extracellular enzymes or polysaccharide "slimes- that are characterisuc 
30 of most other surface attached bacteria. They have simple requirements for growth. 
The organism is ubiquitous in the environment and has been isolated from 
oligotrophic to mesotrophic situations. Caulobacters are known for their ability to 
tolerate low nutrient level stresses, for example, low phosphate levels. This nutrient 
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can be limiting in many leachate waste streams, especially those with high levels of 

iron or calcium. 

m:- . 

Freshwater Caulobacter producing S-layers may be readily detected by 
5 negative stain transmission electron microscopy techniques. Caulobacter may be 
isolated using the methods outlined by MacRae, J.D. and Smit (1991) Applied and 
Environmental Microbiology 57:751-758. which take advantage of the fact that 
Caulobacter an to!erate P eriods of nation while other soil and water bacteria may 
not and that they all produce a distinctive stalk structure, visible by light microscopy 

10 (using either phase contrast or standard dye staining methods). Once Caulobacter 
strains are isolated in atypical procedure, colonies are suspended in 2% ammonium 
molybdate negative stain and applied to plasUc-filmed, carbon-stabilized 300 or 
400 mesh.copper or nickel grids and examined in a transmission electron microscope 
at 60 kilovolt accelerating voltage, as described in Smit, J. (1986) "Protein Surface 

15 Layers of Bacteria", in Outer Membran es as Model M . mougCi 

Ed. J. Wiley & Sons, at page 343-376. S-layers are seen a two-dimensionai 
geometric patterns most readily on those cells in a colony that have lysed and released 
their internal contents. 

2 0 The S-layer of different freshwater Caulobacter is hexagonally arranged with a 

similar centre-centre dimension and anusera raised against the S-layer protein of 
C. crescenms strain CB15 reacts with S-layer proteins from other Caulobacter 
(see: Walker, S.G., et_aj. (1992) (J. Bacteriol. 174:1783-1792). All S-layer proteins 
isolated from Caulobacter may be substantially purified using the same extraction 
25 method (pH extraction). All strains appear to have a ^polysaccharide (LPS) 
reactive with antisera against the CB15 strain lipopolysaccharide species. The LPS 
appears to be required for S-layer attachment. 

The S-layer elaborated by freshwater isolates of Caulobacter are visibly 

3 0 indistinguishable from the S-layer produced by Caulobacter crescenms strains CB2 

and CB15. The S-layer proteins from the latter strains have approximately 
100,000 m.w. although sizes of S-layer proteins from other species and strains will 
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vary. The protein has been characterized both structurally and chemically. It is 
composed of ring-like structures spaced at 22nm intervals arranged in a hexagonal 

manner on the outer membrane. The S-Iayer is bound to the bacterial surface »nri mn- 

. . — — j 

be removed by low pH treatment or by treatment with a calcium chelator such as 
EDTA. 

The similarity of S-layer proteins in different strains of Caulobacter permits 
the use of a cloned S-layer protein gene of one Caulobacter strain for retrieval of the 
corresponding gene in other Caulobacter strains (see: Walker, S.G. et al . (1992) 
[ supra ]; and, MacRae, J.D. et al . (1991) (supra]. 

Expression, secretion and optionally, presentation of a heterologous 
polypeptide in Caulobacter provides advantages not previously seen in systems using 
organisms such as E. coli and Salmonella in which fusion products using different 
surface proteins have been reported. All known Caulobacter strains axe believed to be 
harmless and are nearly ubiquitous in aquatic environments. In contrast, many 
Salmonella and E. coli strains are pathogens. Consequently, expression and secretion 
of a heterologous polypeptide using Caulobacter as a vehicle will have the advantage 
that the expression system will be stable in a variety of outdoor environments and may 
not present problems associated with the use of a pathogenic organism. Furthermore. 
Caulobacter are natural biofilm forming species and may be adapted for use in fixed 
biofilm bioreactors. The quantity of S-layer protein that is synthesized and is secreted 
by Caulobacter is high, reaching 12% of the cell protein. The unique characteristics 
of the repetitive, rwo-dimensional S-layer would also make such bacteria ideal for use 
as an expression system, or as a presentation surface for heterologous polypeptides. 
This is desirable in a live vaccine to maximize presentation of the antigen or antigenic 
epitope. In addition, use of such a presentation surface to achieve maximal exposure 
of a desired polypeptide to the environment results in such bacteria being particularly 
suited for use in bioreactors or as carriers for the polypeptide in aqueous or terrestrial 
outdoor environments. 



The invention described in the PCT application published September 18, 1997 
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under WO 97/34000 describes the C-terminal region of Caulobacter crescentus 
S-layer protein as being essential for secretion of S-layer protein in that species. 
Heterologous polypeptides may be conveniently expressed and secreted by a host 
Caulobacter wncn Polypeptide is expressed as a fusion with the C-terminal 
5 secretion signal. Further studies with C. crescentus have demonstrated that the 
species employs a type I secretion system which involves an uncleaved C-terminal 
secretion signal on the surface layer protein (RsaA) and several transport proteins 
encoded by genes 3* to the surface layer protein gene (rsaA) (Ajnram, P. and Smit, J. 
(1998) Journal of Bacteriology 180:3062-3069). 

10 

A ^'P'" 1 W 1 secretion system uses three transport protein components. 
One such component, the ABC transporter, is embedded in the inner membrane, 
contains an ATP-binding region, recognizes the C-terminal secretion signal of the 
substrate protein, and hydrolyzes ATP during the transport process. Another 

15 component, the membrane fusion protein (MFP) is anchored in the inner membrane 
and appears to span the periplasm. The remaining component is an outer membrane 
protein (OMP) that is thought to interact with the MFP to form a channel that extends 
from the cytoplasm through the two membranes to the outside of the cell. In 
C. CTescentus, the ABC transporter and the MFP proteins have been termed RsaD and 

20 RsaE (respectively) and their genes are immediately 3* of rsaA. Further downstream 
is the rsaF gene which is believed to encode the OMP. 

It is desirable to provide for the use of Caulobacter species other than 
C. crescentus in the expression and secretion of heterologous polypeptides from a host 
25 organism. 

Summary of Invention 

This invention is based on the discovery that S-layer producing freshwater 
3 0 Caulobacter (other than C. crescentus ) rely on a type I secretion signal located at the 
C-terminus of the S-layer protein and highly conserved transport proteins. While the 
secretion signal itself is not as well conserved as the transport proteins, a secretion 
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signal from a first species of Caulobacter will be recognized by the transport 
mechanism of other species. Thus, a surface layer protein secretion signal derived 
from any freshwater S-layer producing Caulobacter may be used in the invention 
described in WO 97/34000. As well, any Caulobacter which contains a type I 
5 secretion system may be used as a host organism for the expression and secretion of 
heterologous polypeptides to which a Caulobacter S-laver protein secretion signal has 
been fused. Nucleic acid constructs made for expression of heterologous polypeptides 
may include a surface layer protein secretion signal from a Caulobacter other than 
C. crescenrus , for expression in the same species from which the surface layer protein 
1 0 signal was derived or for expression in a different species. Furthermore, a C-termiaal 
secretion signal derived from the S-layer protein (RsaA) of C. crescenru s. may be 
used in such transformation of Caulobacter other than C. crescenrus . 

This invention also provides the use of Caulobacter other than C. crescenrus as 
15 a host organism for the expression of polypeptides heterologous to a surface layer 
protein of the Caulobacter, wherein the Caulobacter has at least one surface layer 
transport protein that is homologous to RsaD or RsaE of C. crescenrus . This 
invention also provides a method for identifying a candidate Caulobacter for such use, 
comprising extracting DNA from the Caulobacter . contacting the DNA with an 
20 oligonucleotide that is selectively hybridizable to one of rsaD and rsaE of 
C. crescenrus , and determining whether the oligonucleotide hybridizes to the DNA. 
The sequences of RsaD and RsaE and coding sequences rsaD and rsaE are known. 

This invention also provides a Caulobacter host, wherein the host comprises at 
25 least one surface layer transport protein having an amino acid sequence homologous 
to RsaD or RsaE. and wherein the host further comprises a DNA construct for 
expression of a polypeptide heterologous to a surface layer protein of the host, the 
construct comprising DNA encoding a heterologous polypeptide 5' to and operably 
linked with DNA encoding a Caulobacter surface layer protein secretion signal, with 
30 the proviso that when the host comprises transport proteins having sequences the same 
as both RsaD and RsaE, the secretion signal is not from C. crescenrus . 
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10 



This invention also provides a DNA construct comprising one or more 
restriction sites for facilitating insertion of DNA into the construct, wherein the 
construct further comprises DNA encoding a Caulobacter surfed ia yer protein 
secretion signal not present in C. crescenms . 

This invention also provides a DNA construct for expression of a heterologous 
polypeptide comprising DNA encoding a polypeptide not present in Caulobacter 
surface layer protein 5' from and operatively linked to DNA encoding a surface layer 
protein secretion signal not present in C. crescentus . 

A surface layer protein secretion signal not present in C. crescentus will 
function as such a signal in a Caulobacter type I secretion system but will not have an 
amino acid sequence that is the same as amino acids 945-1026 of the Rsa protein of 
C. crescentus. The laner sequence (SEQ ID NO: 1) is: 

APGAAVTLGAAATLAQYLDAAAAGDGSGTSVAXWFQFGGDTYVVVDSSAG 
ATFVSGADAVIKLTGLVTLTTSAFATEVLTLA 



This invention also provides a bacterial cell comprising the aforementioned 
20 DNA constructs. Where the bacterial cell is other than C. crescentus . the DNA 
construct may comprise a surface layer protein secretion signal derived from RsaA. 
This invention also provides the use of the aforementioned DNA constructs for 
transformation of bacterial cells and the use of such cells for expression and secretion 
of polypeptides heterologous to the cell. Where the cell is Caulobacter . the 
25 polypeptide is heterologous to the S-layer protein of the cell. This invention also 
provides proteins comprising heterologous material, secreted from a Caulobacter in 
which the secretion signal is not found in C. crescenms, . 



15 



WO 00/49163 



PCT/CA00/O0173 



Description of the Drawings 



For better understanding of this invention, reference may be made to the 
preferred embodiments and examples described below, and the accompanying 
5 drawing in which: 

Figure 1 shows the organization of the C. crescenrus genome with respect to 
the surface layer protein subunit gene (rsaA) and the downstream (3') type I transpon 
protein genes: rsaD (encodes the ABC - transporter), rsaE (encodes the membrane 
10 fusion protein (MPF) and rsaF (encodes the outer membrane protein OMP). LPS 
genes A-F are involved in the production of lipopolysaccharides. 

Description of the Preferred Embodiments 

15 Organisms for use in this invention include all S-layer producing freshwater 

species or strains of Caulobacter . WhiJe simiJarity of the S-layer gene and S-layer 
secretion systems permits the use of different S-layer protein producing freshwater 
Caulobacter in this invention, the C-terminal secretion signals of the S-layer genes of 
C. crescenrus strains CB2 and CB15 (and variants of those strains which contain 

2 0 homologs of the rsaA gene encoding the 1026 amino acid paracrystalline S-layer 
protein described in: Gilchrist, A. et_aJ. 1992 Can. J. Microbiol. 38:193-208) are 
specifically referred to in the detailed description and Examples set out below. 

Caulobacter strains that are incapable of forming an. S-layer, including those 
25 which shed the S-layer protein upon secretion, may be used in this invention. 
Examples are the S-layer negative mutants CB2A and CBlSAKSac described in 
Smit, J., and N. Agabian (1984) J. Bacterid. 160:1137-1145; and, Edwards, P., and 
J. Smit (1991) J. Bacterid. .173:5568-5572. Examples of shedding strains are 
CB15Ca5 and CB15Cal0 described in Edwards and Smit (1991). and the smooth 
30 lipopolysaccharide deficient mutants described in Walker. S.G. et ai. (1994) 
J. Bacteriol. 176:6312-6323. 
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A heterologous polypeptide as referred l0 herein ^ fc . 
poiypepnde. protein or a pan of a pro«ein which is desired ,o be expressed * 
»cter and which may be secreted by the bacterium. A polypeptide that is 
. -rologous l0 a surface layer protein of a Oujpb^ means . „ 
5 found ,„ a* surface ,ayer protetn r»,ive to .he Caulobacter in which «he heterologous 
polypeptide is expressed. 

Heterologous polypeptides include enzymes and other functional sequences of 
ammo acids as well as. ligands. antigens, antigenic epitopes and haptens The size of 
:o the heterologous polypeptide will be selected depending upon whether an taact 
S-layer ,s to be produced in the Caulobacter or whether the protein to be recovered 
from the baceriaj medium as described below. Heterologous polypeptides of about 
400 ammo acids have been expressed. Preferably, the cysteine content of a* 
heterologous polypeptide and the capacity for formauon of disulphide bonds within 
1 5 the chuneric pro.cn will be kept to a minimum to minimize dismption of the secredon 
of .he chimeric protem. However, the presence of cysteine residues capable of 
forming a disulphide bond which are relatively close together, ma, „„, affect 



secretion. 



20 This invention may be practised by implementing known methods for insertion 

of a selected heterologous coding sequence into all or part of an S-layer protein gene 
so that both the S-layer protein and the heterologous sequence are operably linked 
thereby permitting the S-layer protein and the heterologous sequence to be transcribed 
together and -in-frame". Sequencing of an S-layer protein gene permits one to 

.5 .denufy potential sites to install heterologous genetic material. The repetitive nature 
of the protein in the S-layer permits multiple copies of heterologous polypeptides to be 
expressed. 

The following general procedure lays out courses of action, with reference to 
3 0 particular plasmid vectors or constructions, tha, may be used to accomplish fusion of 
an S-Layer protein with a polypeptide of interest. The following description makes 
reference to the rsaA gene of C. crcscemus described by Gilchrist. A. eral (1992) 
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and in WO 97/34000. as an example. 

The general procedure includes detailed steps allowing for the following 
possibilities: 

(1) use of a collection of potentially permissive sites in the S-layer gene to 
install the genetic information for a polypeptide of interest; 

(2) use of a carrier cassette for delivering a gene of interest to sites within 
the S-layer gene; 

(3) creation of a collection of random insertion sites based on a restriction 
enzyme of choice, if the available collection of potentially permissive sites is for some 
reason unsuitable; and, 

(4) direct insertion of DNA coding for a polypeptide of interest into 
permissive sites. 

The general procedure involves the following steps and alternative courses of 
action. As a first step the practitioner may choose an appropriate region (or specific 
amino acid position) of the S-layer for insertion of a desired polypeptide. Second, the 
practitioner will create a unique restriction site (preferably hexameric) in the S-layer 
gene at a position within the gene encoding that region (or corresponding to a specific 
amino acid) using either standard linker mutagenesis (regional) or site directed 
mutagenesis (specific amino acid). The unique restriction site will act as a site for 
accepting DNA encoding the polypeptide of interest. For example, the plasmid-based 
promoter-less version of the rsaA gene (pTZ18U:rsaA P) described in Gilchrist; A. 
et_aj. (1992) may be used because it contains an appropriate combination of 5' and 3' 
restriction sites useful for subsequent steps. Preferably, the restriction site should not 
occur in the S-layer gene, its carrier plasmid or the DNA sequence coding for the 
polypeptide of interest. 



WO 00/49163 



- 10- 



PCT/CA00/00173 



If it is unclear which region of the S-layer would be suitable for insenion of a 
polypeptide of interest, a random linker mutagenesis approach may be used to 
randomly insert a unique linker-encoded restriction site (preferably hexameric) at 
various positions in the gene. Sites for insenion of the linker are created using an 
5 endonuclease, either of a sequence specific nature (e.g. tetrameric recognition^ 
restriction enzyme) or sequence non-specific nature (e.g. Deoxyribonucleic I 
[DNase II). A particularly suitable method is the generalized selectable linker 
mutagenesis approach described in Bingle, W.H., and J. Smit. (1991) Biotechniques 
10: 150-152, by which endonuclease digestion is carried out under panial digestion 
10 conditions and a library of linker insertions at different positions in a gene is created. 
Panial digestion with different endonucleases create potential sites for insenion of a 
linker. 

If restriction endonucleases are used to create sites for subsequent insenion cf 
15 a linker encoding a hexameric restriction site, mutagenesis may also be done with a 
mixture of 3 different linkers incorporating appropriate spacer nucleotides in order to 
obtain an insertion with proper reading frame at a particular restriction site. With 
DNase I, only one linker is needed, but only 1 of 3 linker insertions may be useful for 
accepting DNA encoding the polypeptide of interest depending on the position of the 
20 DNase 1 cleavage. 



25 



A linker tagged with a rharker may be used to insert DNA of interest at a 
restriction site. For example, if BamHI sites are appropriate as sites for the 
introduction of DNA encoding a polypeptide of interest, BamHI linkers tagged with a 
kanamycin-resistance gene for selectable linker mutagenesis may be used' One such 
12-bp linker carried in plasmid pUC1021K for use in rsaA was described by Bing it 
and Smit (1991). Two additional 15-bp linkers (pUC7165K and pTZ6571K) fcr 
creating 2 other possible translation frames within the linker insen itself axe described 
in Figures 3 and 4 of WO 97/34000. A mixture of three such linkers is preferably 
30 used for mutagenesis. 



a library composed of linker insertions encoding desired a hexameric 
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restriction site at different positions has been created, DNA encoding a polypeptide of 
interest may be inserted into the sites en masse . The library may be digested with the 
restriction enzyme specific for the newly-introduced linker encoded restriction site and 
ligated to a DNA fragment encoding the polypeptide of interest and carrying the 
appropriate complementary cohesive termini. DNA specifying the polypeptide of 
interest can be prepared by a number of standard methods, which may include 
oligonucleotide synthesis'of 2 anti-complementary strands, polymerase chain reaction 
(PCRj procedures, or addition of linkers whose termini are compatible with the 
introduced sites in the target gene to a suitably modified segment of DNA. 



10 



In order to facilitate the rapid recovery of genes carrying DNA inserted at 
restriction sites, a carrier oligonucleotide may be used. An example of the use of 
such a carrier, shown in Figure 1 of WO 97/34000, was designed to accept DNA 
prepared by PCR or by annealing synthesized oligonucleotides and controls direction 

15 of insertion of the foreign segment into a rsaA gene through use of a promoterless 
drug resistance marker. The DNA of interest is first directionally cloned, if possible, 
using the Xhol. StuI, or Sail sites or non-directionally cloned using any one of the 
sites in the same orientation as a promoterless chloramphenicol resistance (CmR) 
gene. To do this the DNA of interest may be provided with the appropriate termini 

20 for cloning and spacer nucleotides for maintaining correct reading frame within the 
cassette and should not contain a Belli site. For insertion into the BamHI linker 
library, the DNA of interest is recovered as a BamHI fragment tagged with a CmR 
gene. When ligated to the BamHI digested rsaA linker library, only those colonies of 
the bacterium (eg. E. coli ) used for the gene modification steps that are recovered will 

2 5 be those carrying insertions of the desired DNA in the correct orientation, since the 

promoter on the plasmid is 5' to rsaA P and the CmR gene. This eliminates screening 
for DNA introduction and increases the recovery of useful clones. While still 
manipulating the library as one unit, the CmR gene is removed using BgJII. The 
carrier oligonucleotide also provides the opportunity to add DNA 5' or 3' to the DNA 

3 0 of interest at SaJI. Xhol or StuI sites providing the DNA of interest does not contain 

any of these sites. This allows some control over spacing between rsaA sequences 
and the sequence of the DNA of interest. 
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Genes carrying the DNA of interest in the correct orientation may be excised 
from the plasmid and transferred to a suitable vector providing a promoter recoenized 
by Cautebacjer. Such vectors include pWB9 or P WB10 with EcoRI/Sstl sites, as 
5 described in Bingle, W.H., and J. Smit. (1990). Plasmid 24: 143-148. The DNA of 
interest should not contain the same restriction sites present in the vector. This allows 
expression of the hybrids in S-layer negative mutants of Caulobacter . 

Caul0bacter su ™ vin g *™fcr are examined for chimeric protein secretion, 
10 and optionally S-layer assembly or presentation of the new polypeptide activity,' 
antigenicity, etc. on the cell, by methods specific to the needs of the investigator or 
the capabilities of the inserted sequence. Many of the sites created are "benign" as 
they have-no effect on the functional regions of the protein involved with export, self 
assembly, etc. However, not every site that results -in an absence of functional 
1 5 disruption of the S-layer is best for insertion of new activities. Some sites may not be 
well exposed on the surface of the organism and other sites may not tolerate insertion 
of much more DNA than the linker sequence. 

It is possible to express single or multiple insertions of heterologous 
2 0 polypeptides in a S-layer chimeric protein which will still assemble as an S-layer on 
the cell surface. Some sites may be sensitive to even small insertions resulting in the 
chimeric protein being released into the medium. Release may also be deliberately 
effected by use of a shedding strain of Caulobacter to express the chimeric protein or 
by physical removal of the S-layer from whole cells. Where S-layer assembly is not 
2 5 required, quite large polypeptides may be expressed as part of the S-layer protein. 
Expressing, a chimeric protein containing a S-layer protein component having 
substantial deletions, may increase the size of the heterologous polypeptides that will 
be expressed and secreted by Caulobacter . 



30 



The preceding methods describe insertion of linkers in-frame into a 
promoterless version of the S-layer gene. The sites that are introduced allow 
subsequent insertion of foreign DNA in-frame into the full length gene. This 
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invention also involves the construction of chimeric S-layer protein genes and the 
resulting production of chimeric S-layer proteins in which the S-layer gene component 
lacks large portions of the gene. This reduces the amount of Caulobacter protein 
present in the secreted chimeric protein. Generally, large deletions throughout the 
3 S-layer gene will result in a chimeric protein that is not capable of forming an S-layer. 
Attachment of the S-layer to the cell is abolished if the N-terminal amino acids which 
contribute to S-layer formation are deleted. For example, deletion of the first 
29 N-terminal amino acids of the RsaA protein will accomplish this. Also absence of 
the first 776 amino acids from the N-terminal region of RsaA will result in a chimeric 

10 protein secreted from the cell but having a S-layer component consisting of only the 
250 C-terminal amino acids of RsaA. Since only the extreme C-terminal region 
corresponding to approximately amino acids 945-1026 of RsaA is required for 
secretion of an S-layer chimeric protein from C. crescentus . use of only the 
C terminal secretion signal will prevent S-layer formation. Furthermore, use of only 

15 the C-terminal region promotes spontaneous aggregation of much of the secreted 
chimeric protein in the cell medium and formation of a macroscopic precipitate that 
may be collected with a course mesh or sheared to micron-sized panicles. Yields of 
up to 250 mg. (dry weight) of protein per liter of cells may be possible. 

20 Sequence analysis of the 3' region of the S-layer genes from different strains 

of C. crescentus shows that the portion of the gene encoding the C-terminal region of 
the S-layer protein is highly conserved whhin the species. It has now been 
determined that while there is moderate variability in the sequence of surface layer 
proteins (including the secretion signal) from different species of freshwater 

2 5 Caulobacter , there is an unusually high sequence conservation among different 

Caulobacter s P ecies for ABC-transponer protein and the membrane fusion protein. 
Sequence analysis of CB15 and CB2A (readily distinguishable strains of 
C. crescentus ) shows identical DNA sequences coding for the last 118 amino acids of 
the RsaA protein (which includes the secretion signal) and sequencing of the next 

3 0 downstream translated gene (rsaD) to amino acid 97 of the gene product shows only a 

single base pair change, resulting in a conservative amino acid substitution in the 
ABC transporter protein. Sequence analysis of surface layer protein genes and the 
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transporter pro™ genes in spccies other ^ g 3^ ^ ^ 
.he secretion si gna .s between species as compared ,c strain of C. crescemus 
However, a much higher leve. of inter-species conservation «i SU w im rel^e 
transporter proteins, even (as is the case w ilh some species of Cau.obacter . when * 

5 goners are no, located in^edi.tCy downstream from the surface layer pro«in 
gene. r 

U having now been demonstrated «ha, species of Caulobacter other a™ 
C. crescenrus employ a C-terminal secretion signal for the surface layer protein and 
10 contain highly conserved transport proteins, the procedures described herein or are 
known in the an. may be readily employed for use surface layer protein secre,io„ 
s.gnals from Caulobacter other than C. crescemus and to identify and use species 
°<her than C. crescemm as a host for expression of heterologous polypeptides. 

15 The moderate inter-species conservation of surface layer protein genes 

(particularly for glycine-aspanie acid rich regions of the protein) may be exploited for 
locating a S-layer protein gene in a candidate Caulobacter . using known methc* 
Alternatively, the gene may be located by searching for a sequence hybridal, to a 
sequence derived from an anuno acid sequence which is determined by sequencing the 

2 0 S-layer protein secreted by the candidate cell, using methods in the known art. 

The minimal amino acid tract from a Caulobacter Out constitutes the essential 
surface layer protein secretion signal may be determined by the procedures described 
herein or by methods known in the an. One approach is to identify regions from 
2 5 S-layer genes of a Caulobacter which code for amino acid sequences that exhibit some 
■dennty to the las. 82C-erminal restdues of the RsaA protein of C crescemus 
Homology to upstream sequences in the protein may also be assessed. Another 
approach is to delete NMerminal amino acids from the surface layer protein until 
secretion is lost. 
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Caulobacter other than C. crescemus may be screened for suitability as hosts 
for expression and secretion of heterologous polypeptides by defining whether * 
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candidate cell has a gene or gene product which exhibits sufficient identity to the rsaD 
or rsaE genes or RsaD or RsaE proteins from C. crescentus . This may be 
conveniently accomplished by determining whether a oligonucleotide probe based on 
me rsaD or rsaE gene sequences will selectively hybridize to DNA from the candidate 
5 cell. The probe is prepared by any means for construction of an oligonucleotide and 
will preferably have a sequence that is homologous to all or pan of rsaD or rsaE. 
The probe will consist of at least 20. more preferably at least 30, more preferably at 
least 40, and even more preferably at least 50 nucleotides. The probe may be used 
for amplification by known procedures (eg. by PCR) of target DNA or may be 

10 labelled for direct determination of the presence of target DNA by known procedures. 
Labels include radio-labels, fluorescent labels, etc. Detection of target DNA may be 
accomplished through various standard techniques such as Southern blotting, in-situ 
hybridization, etc. Caulobacter other than C. crescentus are useful as host organisms 
for expression and secretion of heterologous polypeptides when the host contains a 

15 transport protein that is homologous to either the RsaD or RsaE proteins of 
C. crescentus . 

An amino acid or nucleic acid sequence is 'homologous- to another such 
sequence if the two sequences axe substantially identical and the function of the 

20 sequences is conserved (for example, both sequences function as or encode a secretion 
signal or a transport protein functional in Caulobacter ). Two amino acid or nucleic 
acid sequences are considered substantially identical if they share at least about 70S 
sequence identity, preferably at least about 80% sequence identity, more preferably at 
least about 90% sequence identity. Sequence identity may be determined using the 

25 BLAST algorithm, described in Altschul et_aL (1990). J. Mol. Biol. 215:403-10 
(using the published default settings). In such circumstances, percentage of sequence 
identity may be expressed as a "homology" of the same percentage. 

An alternative indication that two nucleic acid sequences are homologous 
30 (substantially identical) is when rwo sequences selectively hybridize to each other 
under at least moderately stringent conditions. Hybridization to filter-bound 
sequences under moderately stringent conditions may, for example, be performed in 
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0. 5 M NaHPO., 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65-C and 
washing in 0.2 x SSC/0,.% SDS.a, 42-C (xe Ausubel ^ ^ m9 ' 
Protocols in Molecular Bioln,v , Vol. 1, Green Publishing Associates, he and John 
Wiley A Sons. Inc., New York, a, page 2, 10 .3>. Higher sequence identity is 

5 demonstrated by hybridization to futer-bound sequences under stringent conditions 
which may (for example) be performed in 0.5 M NaHPO., 1% SDS. 1 mM EDTA a> 
65<, and washing in 0.1 x SSC/0.1 % SDS a, 68-C (see Ausubel, ail. (eds) 1989) 
Hybridization conditions ;may be modified in accordance with known methods 
dependmg on the sequence of interest (see Tijssen, 1993. Laboratory Ter^ ,,.. „ 
10 B'°"""istrv a"" Molecular Biology ■ Hyb ridization with NurUir pan 

1. Chapter 2 "Overview of Principles of Hybridization and the Strategy of Nucleic 
Acid Probe Assays". Elsevier. New York). Generally, stringent conditions are 
selected to be about 5-C lower tan the thermal melting poin, for the specific 
sequence at a defined ionic strength and pH. 

15 

In this invention, screening of Caulobacter for use as a host organism 
according to transport protein sequence identity may involve the use of 
oligonucleotide probes designed to selectively hybridize to target DNA, if the target 
contains DNA that is homologous (substantially identical) to all or pan of rsaD or 

20 rsaE. Caulobacter of this invention comprise DNA encoding a transport protein that 
is homologous to either rsaD or rsaE. However, as disclosed herein, surface layer 
protein secretion signals useful in this invention may not exhibit such hieh identity to 
the secretion signal of the rsaA gene. The level of identity to RsaA or rsaA sequences 
might be less than 50%, which is lower than that required for "homology" as defined 

25 above. In such cases, the secretion signal may be solely defined according to is 
ability to effect transport of a protein of which the signal is the C-terminus. through 
the type 1 secretory system of a Caulobacter. The presence or absence of this function 
may be readily determined by monitoring extra-cellular occurrence of a protein of 
interest using known means or the procedures described herein. 

30 

Expression of heterologous polypeptides may be practised by use of modified 
S-layer genes borne on plasmids which may be readily constructed and introduced to 
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10 



Caulobacter b y electroponation. Typically, the plasmid is maintained in the 
Caulobacter by antibiotic selection. Highly modified S-layer genes with attached 
heterologous sequences may also be introduced into Caulobacter on a plasmid that is 
not replicated by Caulobacter since homologous recombination of the incoming 
modified S-layer gene with the chromosome-resident copy of the S-layer gene in the 
cell will often occur at a low but practicable frequency resulting in a gene rescue or 
transfer event. In some cases it may be desirable to obtain a stable cell line in which 
the chimeric S-layer gene is chromosomal. Various protocols for creating 
chromosomal insertions are set out in the Examples. 



Use of Caulobacter S-layer protein as a vehicle for production of a 
heterologous polypeptide has several advantages. Firstly, the S-layer protein is 
synthesized in large quantities and has a generally repetitive sequence. . This permits 
the development of systems for synthesis of a relatively large amount of heterologous 

15 material as a fusion product with an S-layer protein (chimeric protein). It may be 
desirable to retain the chimeric protein as part of the bacterial cell envelope or, the 
fusion product may be separated from the organism, such as by the method described 
in: Walker. S.G., eral. (1992) J. Bacterid. 174:1783-1792. Alternatively, the 
Caulobacter strain that is used to express the fusion product may be derived from a 

20 strain such as CB15Ca5 that sheds its S-layer. 



Caulobacter are particularly suited for use in bioreactor systems. An example 
would be the use of a modified Caulobacter to treat sewage, waste water etc. 
Caulobacters are ideal candidates for fixed-cell bioreactors, the construction of which 

25 is well-known (eg. rotating biological contactors). Other bacteria often produce 
copious polysaccharide slimes that quickly plug filtration systems. In some cases, 
other bacteria are not surface-adherent. By taking advantage of the natural bio-film 
forming characteristics of Caulobacter . bioreactors may be formed comprising a 
substrate and a single layer of cells adhered thereon, with the cells distributed at high 

30 density. A variety of substrates may be used such as a column of chemically 
derivatized glass beads or. a porous ceramic material such as ceramic foam. 
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Another application is in the production of batch cultures of modified 
Caulobacter wherein the S-layer protein is a fusion product with an enzyme. For 
example, such Caulobacter could be grown in wood pulp suspensions at an 
appropriate juncture of the pulping process in order to provide for enzymatic 
5 decomposition of the wood-pulp structure. 

Examples of enzymes that may be expressed as chimeric S-layer proteins 
include alkaline phosphatase (eg. by expression of the pho A gene of Ecoli; 
see: Hoffman, C.S., and Wright, A; (1985) Proc. Natl. Acad. Sci. U.S.A. 
10 82:5107-5111; Bingle, W.H;, et_al. (1993) Can. J. Microbiol.39: 70-80; and, Bingle. 
W.H. and Smit, J. (1994) Can. J. Microbiol. 40:777-782.) and, cellulase (eg. by 
expression of the CenA gene of Cellulomonas fimi ; see: Bingle, W.H. et al. (1993). 
and, Bingle, W.H. and Smit, J. (1994). 

15 Another application is the production of organisms that secrete and optionally 

present vaccine-candidate epitopes. Modified Caulobacter may be readily cultured in 
outdoor freshwater environments and would be particularly useful as fish vaccines. 
The two-dimensional crystalline array of the S-protein layer of Caulobacter . which 
has a geometrically regular, repetitive structure, provides an ideal means for dense 

2 0 packing and presentation of an epitope as pan of an intact S-layer on the bacterial cell 
surface. 



Polypeptides secieted by Caulobacter may be harvested in large quantities, 
relatively free of contaminants and protein of host cell origin. Expression of a 

2 5 heterologous polypeptide fused with sufficient C-terminal amino acids of the S-layer 

protein to promote secretion of the protein results in the accumulation of large 
quantities of secreted product in the cell medium. The chimeric protein does not have 
to be released from the cell surface, but adjustment of the size of the S-layer protein 
portion can dictate whether the secreted chimeric protein is soluble or will precipitate 

3 0 in the cell medium. This is useful in cases where the Caulobacter is used to express a 

foreign antigenic component and it is desired to minimize the amount of host cell 
protein associated with the antigen. 
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Example 1 : Producti on of Permissive Insertion Sites in C. crescentus 

Using the restriction enzyme TagI, a partial digestion of the rsaA gene in 
5 P TZ18U:rsaA P produced a group of linearized segments with random TagI sites 
cleaved. The linearized segments were modified by use of the tagged linker 
mutagenesis procedure of Single and Smit (1991), using the 12-bp BamHI linker 
carried in plasmid pUC102K discussed in the general procedure above. Those 
products that produced a full-length protein in E. coli were ultimately transferred to 

10 pWBI (a minor variation of pWB9 that is replicated by Caulpbacter), as described in 
the general procedure. The resulting construction was introduced into a C. crescentus 
strain. Distinguishable events were retrieved and analyzed for the ability to produce a 
full-length protein in C. crescentus and to produce the crystalline S-layer on their 
surface and the approximate location of the insertion. Cells were screened for the 

15 presence of a S-layer protein of approximately lOOkDa that is extracted from the 
surface of whole cells by 100 mM HEPES at ph2. The results of this screening 
resulted in five successful events. 

The five positive events represented cases where a 4-amino acid insertion was 
20 tolerated with no effect on the S-layer function. The S-layers of the modified 
Caulobacter were indistinguishable from a wild-type S-layer. By producing 
3 versions of the gene of interest, representing each possible reading frame (using 
standard linker addition technology), one may test each of these sites for suitability in 
expressing the desired activity. Also, by using restriction enzymes other than TagI 

2 5 (such as Aril, HinPl or MspT) a larger library of BamHI insertions may be created. 

Example 2: Investigation of Other Permissive Sites in rsaA Gene 

A library of 240 BamHI linker insertions was created using the procedures of 

3 0 Example 1. Of the 240 insertions, 45 target sites in the rsaA gene were made with 

TagI. 34 of the latter insertions were discarded because the clones contained deletions 
of rsaA DNA as well as the linker insertions. The remaining 11 resulted in 
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5 non-permissive and the 5 permissive sites found in Example 1. The remaining 
195 insertions in the library were made using the enzymes HinPI, Acil, and Mspj to 
create target sites as outlined in Example 1. Of the latter 195 insertions. 
49 permissive sites were located for a total of 55. Of those sites scored as 
5 non-permissive, some may have had deletions of rsaA DNA at the linker insertion 
site. One BamHI linker insertion at a TaqI site thought to be permissive was later 
found by nucleotide sequencing to be located outside the rsaA structural gene reducing 
the total number of permissive. sites to 54 from 55. The results show that sites that 
will accept 2-4 amino acids while still allowing the protein to be made and assembled 
10 into an S-layer are scattered up and down the protein. There is a high proportion of 
sites at which such insertions- do not prevent expression and assembly of the S-layer. 
Approximately 25-50% of in-frame linker insertions will be tolerated by the S-layer 
protein and the Caulobacter and that diverse regions of the protein will tolerate 
insertions. 

15 

Example 3: Studies with Cadmium Binding Polypep tides 

Following the foregoing procedures, single and multiple copies of DNA 
encoding a synthetic cadmium binding peptide were synthesized, inserted at the amino 
2 0 acid 277 site of rsaA using the above described Carrier cassette, and expressed in 
C. crescentus . The peptide has a single cysteine residue. Mild acid extracts of whole 
cells expressing the modified gene were subjected to SDS-PAGE for identification of 
S-layer proteins. The S-layer protein was expressed and secreted when there was 
from 1 to 3 copies of the cadmium binding peptide present at RsaA amino acid 

2 5 position 277. Insertion of 4 or more copies resulted in a dramatic reduction of S-layer 

protein released from the whole cells by mild acid treatment to barely detectable 
levels. Detection by autoradiography of RsaA protein in vivo labelled with *S- 
cysteine and invitro with ns I- iodoacetamide confirmed that the cadmium binding 
peptide was pan of the chimeric protein. This demonstrates that C. crescentus is 

3 0 capable of secretion of a chimeric rsaA protein having a limited cysteine content and a 

limited capacity for disulphide bond formation but that increased capacity of 
disulphide bond formation will limit production. 
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Example 4: Expression and P resentation of Antigenic Epitopes on Caulobacter fvn 
Surface 



5 Using the library of the 49 permissive sites other than those made with Tag! 

described in Example 2, the coding sequence for a 12-arnino acid pilus peptide 
epitope lacking cysteine residues from Pseudomonas aeruginosa PAK pilus (described 
in Figure 8 of WO 97/34000) was inserted at the sites using the procedures described 
above, employing the carrier cassette described above. Positioning of the inserted 
1 0 DNA berween the first Bam HI site and the BgJ II site permitted use of the latter site 
for making repeated insertions of DNA. DNA coding for the PAK pilus peptide was 
prepared by oligonucleotide synthesis of two anti-complementary strands. 

The transformed bacteria were screened for both production and presentation 
15 of the epitopes by the transformed Caulobacter using standard Western immunoblot 
analysis (see: Bumette, W. N. (1981) Analytical Biochemistry 112:195-203) and by 
colony immunoblot tests in which the cells were not disrupted (see: Engleberg, N.C., 
et_al. (1984) Infection and Immunity 44:222-227). Anti-pilus monoclonal antibody 
(PK99H) obtained from Dr. Irvin, Dept. of Microbiology, University of AJbena. 
2 0 Canada was used in the immunoblot analyses to detect the presence of the pilus 
epitope insert. The antibody was prepared using purified Pseudomonas aeruginosa 
PAK pilus as the antigen and a monoclonal antibody was isolated by standard 
techniques using BALB/C mice as a source of ascites fluid. Reaction with the 
antibody in a whole cell colony immunoblot assay showed that the epitope is not only 
2 5 expressed in the transformed Caulobacter but is exposed on the S-layer surface 
overlying the cell in such a way that the epitope is available to the antibody. When 
two cysteine residues of the pilin epitope were incorporated in the chimeric protein, 
the protein was still expressed and secreted at normal levels. 



30 



Of the organisms screened, insertions of the pilus epitope at the following sites 
in the rsaA gene as determined by nucleotide sequencing resulted in a positive 
reaction with the antibody in the whole cell Colony immunoblot analysis: 69, 277, 
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353. 450, 485, 467, 551, 574, 622, 690. 723, and 944. The results show that the 
permissive sites that will accept polypeptides of the size of the epitope are numerous 
and scattered across the gene. 

5 Further studies with the pilus peptide resulted in successful expression and 

secretion of chimeric proteins having single copies of the peptide at various other 
locations. Also, four and seven copies of the peptide were expressed and secreted as 
a RsaA chimeric protein when inserted at amino acids 277 and 551 respectively of the 
RsaA protein. However, insertions of the peptide at amino acids 69, 277, 450, 551 
10 and 622 resulted in a chimeric protein that did not attach to the cell surface and was 
released into the culture medium. 

Example 5: Insertion of Large Polypeptides 

15 Bacterial surface proteins from organisms other than Caulobacter are generally 

not known to accept polypeptides larger than about 60 amino acids within the 
structure of the surface protein. The procedures of the preceding Example were 
carried out in order to insert the coding sequence of a 109 amino acid epitope from 
1HNV virus coat glycoprotein at the same insertion sites. The IHNV epitope was 

2 0 prepared by PCR and had a sequence as shown in Figure 9 of WO 97/34000, which is 
equivalent to amino acid residues 336-444 of the IHNV sequence described 
in: Koener, J.F. et al . (1987) Journal of Virology 61:1342-1349. Anti-IHNV 
polyclonal antibody against whole IHNV obtained from Dr. Joann Leong, Dept. of 
Microbiology, Oregon State University, U.S.A. (see: Xu, L. etal. (1991) Journal of 

25 Virology 65: 161 1-1615), was used in immunoblot assays as described above to screen 
for Caulobacter that express and present the IHNV sequence on the cell surface. 
Reaction in the whole cell colony immunoblot assay was positive in respect of 
insertions at sites 450 and 551, and negative at a site which was at approximately 
amino acid 585. 

30 



The EHNV insert contains a single cysteine residue and is an exuemely large 
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insert for successful expression as a fusion product with a bacterial surface protein. 
In further studies, the same 109 amino acid portion of the IHNV glycoprotein was 
inserted at amino acid 450 of the RsaA protein. The orotein serr^H h„ r 
was recovered from the cell culture medium. SDS-PAGE analysis showed that some 
5 of the protein was smaller than the predicted rsaA chimeric protein but still bound the 
anti-IHNV antibody. Analysis of these proteolytic products showed that cleavage of 
the chimeric protein occurred at an Arg residue encoded by the gene transfer cassette. 
Thus in some cases, adjustment of the nucleotide sequence at the interface of the 
polypeptide and rsaA coding sequences may be necessary to prevent expression of an 
10 arginine residue. 

Example 6: 

Methods are described above for the insertion of 12-bp BamHI linker sites into 
15 a promoterless version of the rsaA gene. Because linker insertions involve the 
insertion of 12 bp (a multiple of three), an in-frame linker insertion resulted in every 
case. These linker sites are introduced to allow subsequent insertion of DNA 
encoding foreign peptide/proteins. Expression of such chimeric genes leads to the 
production of an entire full-length RsaA protein carrying the inserted heterologous 
20 amino acid sequence of interest. A number of BamHI site positions were identified 
above precisely by nucleotide sequencing. Four of the sites in the rsaA gene 
correspond to amino acid positions 188, 782, 905. 944 in the RsaA protein. For this 
example, an additional linker insertion was created at amino acid position 95 of the 
native gene (i.e. this gene carried its own promoter) using the same methodology. All 
25 five in-frame BamHI linker insertion sites were inserted in the rsaA so that the 
nucleotides of the linker DNA were read in the reading frame GGA/TCC. 

Because all BamHI linker nucleotides were read in the same reading frame, 
the 5' region of one rsaA gene carrying a BamHI linker insertion at one position 
30 could be combined with the 3* region of an rsaA gene carrying another of the Bam HI 
linker insertions to create in-frame deletions with a BamHI site at the joint between 
adjacent regions of rsaA. Using such a method, in-frame deletions of rsaA 
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( AA95-782) and rsaA( AA1 88-782) were created. 

DNA fragments encoding various C-terminal portions of the 1026 amino acid 
RsaA protein were isolated using the newly inserted BamHI linker sites as the 
5 5' terminus of the fragment and a Hindm site as the 3' terminus of the fragment. 
These BamHI fragments were transferred to the BamHI/Hindin sites of pUC8 
(J. Vieira, and J. Messing. (1982) Gene 19:259-268) creating rsaA C-terminal 
segment carrier plasmids (see Figure 12 of WO 97/34000). The insertion into pUC8 
also resulted in the creation of an in-frame fusion between the first 10 N-terminal 
0 amino acids of LacZa and the various C-terminal fragments (AA782-1026, 
AA905-1026 or AA944-1026) of RsaA. These LacZarrsaA fusion proteins can be 
produced in Caulobacter using the IacZa transcription/translation initiation signals 
when introduced on appropriate plasmid vectors or direct insertion into the 
chromosome (see: W.H. Bingle, etal. (1993) Can. J. Microbiol. 39:70-80). 

5 

Both types of construction, the deletion versions and the C-terminal only 
segments, resulted in the production of proteins secreted by the Caulobacter as highly 
modified S-layer proteins. The gene segments can also facilitate the secretion of 
heterologous polypeptides by insertion or fusion of appropriate DNA sequences at the 
0 unique BamHI site that exists in each of the constructions, as described below. 

A- Creating Fusions of Desired Sequences with C-terminal Portions of a Caulobacter 
S-layer Gene -Method 1 

5 The process may be as follows: 

W Inserting the desired sequence into the Carrier cassette. Heterologous sequences 
may be introduced into a carrier by: 

0 (a) Insertion of a single copy of the desired gene segment. 

Depending upon the length of a gene segment, two methods of construction 
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may be used. For segments of up to about 30 amino acids, two oligonucleotides of 
appropriate sequence may be chemically synthesized, annealed by mixing, heating and 
slow cooling and ligated into the carrier cassette. The nlionnnrUntiH.. ».;n 
contain additional base pairs that recreate "sticky ends" of appropriate restriction 
endonuclease sites at each end of the duplex DNA that results from the annealing 
process. 



For longer segments, PCR may be used to amplify a region of a target DNA 
sequence. Oligonucleotides are synthesized that have sequence complementary to the 
10 boundaries of the desired sequence and which contain additional base pairs that 
recreate a "sticky end" of an appropriate restriction endonuclease site. In the present 
example oligonucleotides are made to produce products with the appropriate 
restriction endonuclease site for directional cloning into the carrier cassette. PCR 
amplification of the desired sequence is then done by standard methods. 

15 

For each method, sticky ends must be appropriate for restriction sites at the 
5" terminus and the 3' terminus. This places the desired gene segment in the correct 
orientation within the carrier cassette. Reading frame continuity is maintained by- 
appropriate design of the oligonucleotides used for the PCR step. 



20 



(b) Preparation of multiple copies of the desired gene segment. 



The carrier cassette also allows for production of multiple insert copies. For 
example, a restriction site in the "cassette may be restored after removal of a 

2 5 promoterless antibiotic resistance gene and the site is then used to insert an additional 
copy as described in WO 97/34000. This "piggy-back" insertion still maintains the 
correct reading frame throughout the construction. Any number of additional cycles 
of "piggy-backing" can be done because the ligation results in a sequence which is no 
longer a substrate for the restriction enzymes. The result is the production of 

30 cassettes of multiple copies of the desired sequence which can be transferred to 
appropriately modified S-layer protein genes with the same ease as a single copy. An 
additional feature of this method is that different heterologous sequences can be paired 
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together in this multiple copy cassette with, the same ease as multiple copies of the 
same heterologous sequence. 

Exam P le 6a: Insenion of ™ 109 amino acid segment of the IHNV surface 
5 glycoprotein to Carrier cassette. 



A PCR product was made that contained the DNA coding for amino acids 336 
to 444 of the major surface glycoprotein of the Infectious Hematopoietic Necrosis 
Virus (IHNV), as described in WO 97/34000. 

10 

Example 6b: Insenion of an 184 amino acid segment of the IHNV surface 
glycoprciein to Carrier cassette. 

A PCR product was made that contained the DNA coding for amino acids 270 
15 to 453 of the IHNV glycoprotein segment. 

Example 6c: Insertion of single and multiple copies and an epitope of the 
Pseudomonas aeruginosa PAK pilus gene to Carrier cassette. 

20 Oligonucleotides were constructed to code for the pilus epitope described in 

Example 4. Using the methods outlined in pan A(l)(b) of this Example. 3 tandem 
copies were prepared. 

(2) Transfer of Carrier Cassette to C-temiinal Segment Carrier Pbcmirfc 
25 constructs described in Examples 6a and 6b were transferred to a rsaA C-terminai 
Segment carrier plasmid, as described above, resulting in an in-frame fusion of: (a) a 
10 amino acid section of the p-galactosidase protein, (b) the desired sequence flanked 
by 2-3 amino acids derived from the carrier cassette sequence, and (c) the appropriate 
rsaA C-terminal segment. In some cases, the first codon of the rsaA C-terminal 
3 0 segment is converted to a different codon as a result of the fusion. For example, 
while the rsaA C-terminal segment may have coded for amino acids 944-1026 of 
RsaA, the resulting chimeric protein may only have amino acids 945-1026 native to 
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Exam P' e M: Fusion of «rrier/109 AA and 184 IHNV segments to C-ierminal rsaA 
segment AA782-1026. 

5 

This was done using the carrier cassettes described in Examples 6a and 6b and 
the AA782-1026 rsaA C-terminal segment carrier plasmid described above. 

Example 6e: Fusion of Carrier/109 AA and 184 AA IHNV segments to C-terminal 
10 rsaA segment AA905-1026. 

This was done using the carrier cassettes described in Examples 6a and 6b and 
the AA905-1026 rsaA C-terminaJ segment carrier plasmid described above. 

15 Example 6f: Fusion of Carrier/109 AA and 184 AA IHNV segments to C-terminal 
rsaA segment AA944-1026. 

This was done using the carrier cassettes described in Examples 6a and 6b and 
the AA944-1026 rsaA C-terminal segment carrier plasmid described above. 

20 

Exam P le 6 ° : Fusion of Carrier/3x Pilus Epitope Segment to C-terminal rsaA 
Segment AA782-1026. 

This was done using the carrier cassettes described in Example 6c and the 
2 5 AA782- 1026 rsaA C-terminal segment carrier plasmid described above. 

(3) Expression of the Desired Fu sion in an Appropriate Caulohacter Host Strain , 
(a) Plasmid-based expression. 

To create plasmid vectors that can be introduced and maintained in 
Caulobacter , an entire C-terminal segment carrier plasmid may be fused to a broad 



10 
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host range vector such as pKT215 or pKT210 (sec: M. Bagdasarian, etjj. (1981) 
Gene 16:237-247) using the unique Hindin restriction site present in each plasmid. 
The resulting plasmid is introduced into Caulobacter by conjugation or electroporation 
methods and is maintained by appropriate antibiotic selection. 

5 

The fusions described in Examples 6d-6g were expressed in C. crescentus . In 
each case expression and secretion of the chimeric S-layer protein was detected by 
Western immunoblot analysis of electrophoretic gels of the cell culture supermutani 
employing the monoclonal antibody for each of the polypeptide epitopes. The 
transporter signal is located to amino acids 945-1026 of the S-layer protein since all 
the chimeric proteins in the Examples were secreted. Precipitation of the chimeric 
protein occurred with the use of rsaA segment AA78M026 but not AA944-1026. 
Recovery of precipitate using AA9O5-1026 was reduced as compared to AA78M 026. 

1 5 G>)' Selection of appropriate C. crescentus host strains. 

It is often desirable to use a S-layer negative host strain such as CB2A or 
CB15aKSac. If it is important to ensure that the fusion protein is not attached to the 
cell surface, the use C. crescentus strains CB15Ca5KSac or CB15Cal0KSac may be 
2 0 appropriate. The latter strains have additional mutations that result in the loss of the 
production of a specific species of surface ^polysaccharide that has been 
demonstrated to be involved with the surface attachment of native S-layer protein as a 
2-dimensionaI crystalline array (see: Walker S.G. et_al. (1994) J. Bacterid. 
176:6312-6323). With highly modified versions of an S-layer gene, this provision is 

2 5 not necessary since virtually all regions of the gene that may have a role in the 

attachment process will be absent. 

An example of a growth media well suited to both propagation of Caulobacter 
for general purposes (including cloning steps) and also to produce the secreted and 

3 0 aggregated chimeric proteins is PYE medium, a peptone and yeast extract based 

medium described in Walker et al. . (1994). 



WO 00/49163 * n _ 

" O * PCT/CAOO/00173 

This invention also provides a DNA construct comprising one or more 
restriction sites for facilitating insertion of DNA into the construct, wherein the 
construct further comprises DNA encoding a_Caulpbacter_surface layer protein 
secretion signai not present in C. crescenrus . 

5 

This invention also provides a DNA construct for expression of a heterologous 
polypeptide comprising .DNA encoding a polypeptide not present in Caulobacter 
surface layer protein 5' from and operatively linked to DNA encoding a surface layer 
protein secretion signal not present in C. crescentus . 
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15 



A surface layer protein secretion signal not present in C. crescentus will 
function as such a signal in a Caulobacter type: I secretion system but will not have an 
amino acid sequence that is the same as amino acids 945-1026 of the Rsa protein of 
C. crescenrus . The latter sequence (SEQ ID NO:l) is: 

AFGAAVTLGAAATLAQYLDAAAAGDGSGTSVAKWFQFGGDTYVVVDSSAG 
ATFVSGADAVIKLTGLVTLTTSAFATEVLTLA 



This invention also provides a bacterial cell comprising the aforementioned 
2 0 DNA constructs. Where the bacterial cell is other than C. crescentus . the DNA 
construct may comprise a surface layer protein secretion signal derived from RsaA. 
This invention also provides the use of the aforementioned DNA constructs for 
transformation of bacterial cells and the use of such cells for expression and secretion 
of polypeptides heterologous to the cell. Where the cell is Caulobacter . the 
2 5 polypeptide is heterologous to the S-layer protein of the cell. This invention also 
provides proteins comprising heterologous material, secreted from a Caulobacter in 
which the secretion signal is not found in C. crescentus. 
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signal from a first species of Caulobacter will be recognized by the iranspon 
mechanism of other species. Thus, a surface layer protein secretion signal derived 
from any freshwater S-layer producing Caulobacter may be used in the invention 
described in WO 97/34000. As well, any Caulobacter which contains a type I 
5 secretion system may be used as a host organism for the expression and secretion of 
heterologous polypeptides to which a Caulobacter S-laver protein secretion signal has 
been fused. Nucleic acid constructs made for expression of heterologous polypeptides 
may include a surface layer- protein secretion signal from a Caulobacter other than 
C. crescentus , for expression in the same species from which the surface layer protein 
10 signal was derived or for expression in a different species. Furthermore, a C-terminal 
secretion signal derived from the S-layer protein (RsaA) of C. crescentus . may be 
used in such transformation of Caulobacter other than C. crescentus . 

This invenuon also provides the use of Caulobacter other than C. crescentus as 
15 a host organism for the expression of polypeptides heterologous to a surface layer 
protein of the Caulobacter, wherein the Caulobacter has at least one surface layer 
transport protein that is homologous to RsaD or Rsa£ of C. crescentus . This 
invention also provides a method for identifying a candidate Caulobacter for such use, 
comprising extracting DNA from the Caulobacter . contacting the DNA with an 
20 oligonucleotide that is selectively hybridizable to one of rsaD and rsaE of 
C. crescentus, and determining whether the oligonucleotide hybridizes to the DNA. 
The sequences of RsaD and RsaE and coding sequences rsaD and rsaE are known. 

This invenuon also provides a Caulobacter host, wherein the host comprises at 

2 5 least one surface layer transport protein having an amino acid sequence homologous 

to RsaD or RsaE. and wherein the host further comprises a DNA construct for 
expression of a polypeptide heterologous to a surface layer protein of the host, the 
construct comprising DNA encoding a heterologous polypeptide 5' to and operably 
linked with DNA encoding a Caulobacter surface layer protein secretion signal, with 

3 0 the proviso that when the host comprises transport proteins having sequences the same 

as both RsaD and RsaE, the secretion signal is not from C. crescentus . 
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under WO 97/34000 describes the C-terminal region of Caulobacter crescentus 
S-layer protein as being essential for secretion of S-layer protein in that species. 
Heterologous polypeptides may be conveniently expressed and secreted by a host 
Caulobacter when the polypeptide is expressed as a fusion with the C-terminal 
5 secretion signal. Further studies with C. crescentus have demonstrated that the 
species employs a type I secretion system which involves an uncleaved C-terminal 
secretion signal on the surface layer protein (RsaA) and several transport proteins 
encoded by genes 3 s to the surface layer protein gene (rsaA) (Amram, P. and Smit, J. 
(1998) Journal of Bacteriology 180:3062-3069). 



10 



A typical type I secretion system uses three transport protein components. 
One such component, the ABC transporter, is embedded in the inner membrane, 
contains an ATP-binding region, recognizes the C-terminal secretion signal of the 
substrate protein, and hydrolyzes ATP during the transport process. Another 

15 component, the membrane fusion protein (MFP) is anchored in the inner membrane 
and appears to span the periplasm. The remaining component is an outer membrane 
protein (OMP) that is thought to interact with the MFP to form a channel that extends 
from the cytoplasm through the two membranes to the outside of the cell. In 
C. crescentus. the ABC transporter and the MFP proteins have been termed RsaD and 

20 RsaE (respectively) and their genes are immediately 3' of rsaA. Further downstream 
is the rsaF gene which is believed to encode the OMP. 

It is desirable to provide for the use of Caulobacter species other than 
C. crescentus in the expression and secretion of heterologous polypeptides from a host 
25 organism. 

Summary of Invention 

This invention is based on the discovery that S-layer producing freshwater 
30 Caulobacter (other than C. crescentus ) rely on a type I secretion signal located at the 
C-terminus of the S-layer protein and highly conserved transport proteins. While the 
secretion signal itself is not as well conserved as the transport proteins, a secretion 
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vary. The protein has been characterized both structurally and chemically. It is 
composed of ring-like structures spaced at 22nm intervals arranged in a hexagonal 
manner on the outer membrane. The S-layer is bound to the bacterial surface and may 
be removed by low pH treatment or by treatment with a calcium chelator such as 
5 EDTA. 

The similarity of S-layer proteins in different strains of Caulobacter permits 
the use of a cloned S-layer protein gene of one Caulobacter strain for retrieval of the 
corresponding gene in other Caulobacter strains (see: Walker; S.G. etaJ. (1992) 
10 (supra); and, MacRae, J.D. et al: (1991) f supra l. 

Expression, secretion and optionally, presentation of a heterologous 
polypeptide in Caulobacter provides advantages not previously seen in systems using 
organisms such as E. coli and Salmonella in which fusion products using different 

15 surface proteins have been reported. All known Caulobacter strains are believed to be 
harmless and are nearly ubiquitous in aquatic environments. In contrast, many 
Salmonella and E. coli strains are pathogens. Consequently, expression and secretion 
of a heterologous polypeptide using Caulobacter as a vehicle will have the advantage 
that the expression system will be stable in a variety of outdoor environments and may 

20 not present problems associated with the use of a pathogenic organism. Furthermore. 
Caulobacter are natural biofilm forming species and may be adapted for use in fixed 
biofilm bioreactors. The quantity of S-layer protein that is synthesized and is secreted 
by Caulobacter is high, reaching 12% of the cell protein. The unique characteristics 
of the repetitive, two-dimensional S-layer would also make such bacteria ideal for use 

2 5 as an expression system, or as a presentation surface for heterologous polypeptides. 

This is desirable in a live vaccine to maximize presentation of the antigen or antigenic 
epitope. In addition, use of such a presentation surface to achieve maximal exposure 
of a desired polypeptide to the environment results in such bacteria being particularly 
suited for use in bioreactors or as carriers for the polypeptide in aqueous or terrestrial 

3 0 outdoor environments. 

The invention described in the PCT application published September 18, 1997 
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can be limiting in many leachate waste streams, especially those with high levels of 
iron or calcium. 



Freshwater f!an1nharr*»r niv%/4.. /.;„/» c i-. . 

'"" - *" — v, "s ^-'-J^ia may oc readily detected by 
5 negative stain transmission electron microscopy techniques. Caulobacter may be 
isolated using the methods outlined by MacRae, J.D. and Smit (1991) Applied and 
Environmental Microbiology 57:751-758, which take advantage of the fact that 
, W1<w ^ lKMh UI sulrvauon wnjJe olher soU ^ waier bacteria ^ 

not and that they all produce a distinctive stalk structure, visible by light microscopy 
10 (using either phase contrast or standard dye staining methods). Once Caulobacter 
strains are isolated in a typical procedure, colonies are suspended in 2% ammonium 
molybdate negative stain and applied to plastic-filmed, carbon-stabilized 300 or 
400 mesh copper or nickel grids and examined in a transmission electron microscope 
at 60 kilovolt accelerating voltage, as described in Smit, J. (1986) "Protein Surface 
15 Layers of Bacteria", in Outer Membrane* as Model Sy ^ M . 

Ed. J. Wiley & Sons, at page 343-376. S-layers are seen a twc^imensional' 
geometric patterns most readiiy on those cells in a colony that have lysed and released 
their internal contents. 



2 O The S-layer of different freshwater Caulobacter is hexagonally arranged with a 

similar centre«entre dimension and antisera raised against, the S-layer protein of 
C crejeentus strain CB15 reacts with S-layer proteins from other Caulobacter 
(see: Walker, S.G., «al. (1992) (J. Bacteriol. 174:1783-1792). All S-layer proteins 
isolated from Caulobacter may be substantially purified using the same extraction 
25 method (pH extraction). All strains appear to have a lipopolysaccharide (LPS) 
reacuve with antisera against the CB15 strain lipopolysaccharide species. The LPS 
appears to be required for S-layer attachment. 

The S-layer elaborated by freshwater isolates of Caulobacter are visibly 

3 0 indistinguishable from the S-layer produced by Caulobacter crescentus strains CB2 

and CB15. The S-layer proteins from the latter strains have approximately 
100,000 m.w. although sizes of S-layer proteins from other species and strains will 
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PRODUCTION OF HETEROLOGOUS 
POLYPEPTIDES FROM FRESHWATER CAULOBACTER 

Field of Invention 

5 

This invention relates to the use of the Caulobacter surface layer protein 
(S-layer protein) transport system for the expression and secretion of heterologous 

polypeptides from a host organism. 

1 0 Background of the Invention 

Many genera of bacteria assemble layers composed of repetitive, regularly 
aligned, proteinaceous sub-units on the outer surface of the cell. These layers are 
essentially two^Jimensional paracrystalline arrays, and being the outer molecular layer 

1 5 of the organism, directly interface with the environment-. Such layers are commonly 

known as S-layers and are found on members of every taxonomic group of walled 
bacteria including: Archaebacteria; Chlamydia : Cvanobacteria : Acinetobacter : 
Bacillus; Aquaspirillum ; Caulobacter; Clostridium : Chrornarhim . Typically, an" 
S-layer will be composed of an intricate, geometric array of at least one major protein 
20 having a repetitive regular stmcture. In many cases, such as in Caulobacter . the 
S-layer protein is synthesized by the cell in large quantities and the S-layer completely 
envelopes the cell and thus appears to be a protective layer. 

Caulobacter are natural inhabitants of most soil and freshwater environments 

2 5 and may persist in waste water treatment systems and effluents. The bacieria alternate 

between a stalked cell that is attached to a surface, and an adhesive motile dispersal 
cell that searches to find a new surface upon which to stick and conven to a stalked 
cell. The bacieria attach tenaciously to nearly ali surfaces and do so without 
producing the extracellular enzymes or polysaccharide "slimes" that are characterisuc 

3 0 of most other surface attached bacteria.. They have simple requirements for growth. 

The organism is ubiquitous in the environment and has been isolated from 
oligotrophy to mesoirophic situations. Caulobacters are known for their ability to 
tolerate low nutrient level stresses, for example, low phosphate levels. This nutrient 
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B ' Creating Fusions of Desired Sequences with C-terminai Portions -Methnri -> 

Methods other than the use of a carrier cassette plasmid are possible for 
creation of heterologous insertions into deletion versions of a S-iayer gene or fusions 
5 with C-terminaJ portions of a S-layer protein. PCR may be used or other known 
methods may be used. The general procedure is as follows: 



(1) Use of PCR to prepare appropriate segments: 



10 



(a) Preparation of amplified segment with appropriate ends may be carried 
out in a manner similar to that described part A(l)(a) of this example. 
Oligonucleotides are designed and synthesized such that they will anneal to 
appropriate regions of the desired heterologous DN A and aJso contain "sticky ends" 
of appropriate sequence and frame so that the resulting PCR product can. be directly 

1 5 inserted into appropriate modified S-layer genes. 

(b) Transfer to appropriate C-terminal segments may be carried out by 
inserting the PCR products into selected C-terminal segments such as AA782-1026, 
AA905-1026, or AA944-1026, as described in Examples 6d-6g. In addition to the 

20 BamHJ site described, the EcoRl restriction site could also be used as the 5' terminus 
of the incoming PCR segment, since this site is also available in the pUC8 vector and 
not in the S-layer gene, so long as the correct reading frame was maintained when 
designing the oligonucleotides used to prepare the PCR product. 

2 5 (2) Expression of the desired fusion in an appropriate Caulobacter host strain may 
be carried out using the procedures outlined in pan A(3) of this example. 

c " Creating Insertions of Desired Sequences into Versions of a S-laver Gene 
Having Large Internal In-frame Deletions. 

30 



The general process may be as follows, with reference to rsaA: 
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U) Creating Appropriate In-frame Deletions 

rsaA (AA95-782) and rsaA( AA 188-782) may be prepared as described 
above. Because most of the BamHl linker insertion sites are in the same reading 
5 frame with respect to each other, it is possible to combine other pairs of 5' and 3' 
segments using the same general method, with the same result of maintenance of 
correct reading frame throughout. These deletion versions may then be tested 
individually to ensure that S-layer protein is still secreted by the Caulobacter . 

10 (2) Insertion of a Gene Segment c arrier Cassette C ontaining the rw^ 
Sciences: insertion and transfer of carrier cassettes may be done using the 
procedures described in pans A(l) and A(2) of this example. 

Example 6h: Insertion of the 109 AA FHNV segment into rsaA ( AA95-782) and 
15 insertion of the 109 AA IHNV segment into rsaA( AA188-782) may be carried out as 
in Examples 7d-7g. Expression of the desired genetic construction in appropriate 
C. crescenrus strains may be done using the procedures outlined in pan A(3) of this 
example. 

20 (3) Alternate PCR Procedures: may be used to prepare a heterologous segment 
for direct insertion into the BamHI site with the deletion versions of the rsaA gene. 
The procedure is essentially the same as described in pan B(l) of this example. 

Example 7: Transfer to a Nat ive S-laver Gene Chromosomal Site as a Single 
25 Crossover Event 

Fusion of a carrier cassette containing heterologous DNA segments to a 
C-terminal S-layer protein segment plasmid results in a plasmid that is not maintained 
in Caulobacter . Selection for the antibiotic marker on the plasmid results in detection 
3 0 of the rescue events. Most commonly these are single crossover homologous 
recombination events. The result is a direct insenion of the entire plasmid into the 
chromosome. Thus the resident copy of the S-layer gene remains unchanged as well 
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as the incoming modified S-Iayer gene. In such cases it may be desirable to. use 
Caulobacter strains in which the resident S-layer gene has been inactivated by 
adapting known procedures. One example is C. crescenrus strain CBlSAKSac which 
has an antibiotic resistance gene cassette introduced at a position in the S-layer gene 
5 about 25 % of the way from the 5.' terminus. 

Example 8: Transf er to a Native S-laver Gene Chromosomal Site as a Dnuhl* 
Crossover Event 

10 In certain cases it may be desirable to completely exchange a resident S-layer 

gene with an incoming modified version. One method is by the incorporation of a 
sacB gene cassette (Hynes, M.F., et_al. (1989) Gene 78: 111-119) into pUC8 based 
plasmids carrying the desired chimeric gene construction. This cassette contains a 
levansucrase gene from Bacillus subtilis that, in the presence of sucrose, is thought to 

1 5 produce a sugar polymer that is toxic to most bacteria. One first selects for a single 
crossover event as described in Example 7. Subsequent growth on sucrose-containing 
medium results in the death of all cells except those that lose the offending sacB gene 
by homologous recombination within adjacent gene copies. Two events are possible; 
restoration of the resident copy of the S-layer gene or replacement of the resident 

2 0 copy with the incoming modified gene. A screen with insertion DNA as probe or 

antibody specific to the heterologous gene product identifies successful gene 
replacement events. The method requires that S-layer gene sequence or native 
sequences immediately adjacent to an S-layer gene be present on both sides of the 
heterologous sequence and is best suited for deletion versions of a S-layer gene. 

25 

Other methods are available for the delivery of genes to the chromosome of 
Caulobacter. Methods involving the use of the transposons Tn5 and Tn7 as a means 
of delivery of genes to random chromosome locations are available (see: Barry, 
G.F. (1988) Gene 71:75-84.). The use of the xylose utilization operon as a target for 

3 0 chromosome insertion have also been described. This method involves the 

incorporation of a portion the operon into a pUC8 based plasmid construction. This 
allows homologous recombination within the xylose operon as a means of plasmid 
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rescue. Loss of the ability to use xylose as a nutrition s_pur« corrfrms .the. rescue 
event. 



Example 9: Transformation and Expression o f Heterologous Pr otein in Caulnh^r 
5 other than C. crescenrus 



Using the procedures described above, a DNA construct made according to 
Examples 4 and 6 was introduced into the freshwater S-layer producing Ca U iob ac ter 
identified as FWC42 in MacRae, J.D. and J. Smit (1991) and in Walker, S.G. et_aj. 
(1992). FWC42 is clearly distinct as a species separate from C. crescenrus . The 
construct contained 3 copies of the pilus epitope and a nucleotide sequence encoding 
amino acids 690-1026 of RsaA as the secretion signal. The heterologous polypeptide 
was expressed by the transformed FWC42 cells and was secreted at sufficient levels 
such that the secreted protein was found in the cell medium as an aggregate. 
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Example 10: Demon stration of Type I Secretion Mechanism and 
Sequence Similarity in Different Caulobacter Species 

The following non-C. crescenrus species of freshwater Caulobacter as 
20 described in MacRae, J.D. and J. Smit (1991) and in Walker, S.G. etal. (1992) were 
employed in this Example: FWC1, FWC8, FWC9, FWC17 and FWC19, FWC28, 
FWC32, FWC39 and FWC42. 

Employing the materials and methods described in Awram, P. and J. Smit 

2 5 (1998) J. of Bacteriology 180:3062-3069, species FWC8, 9, 17, 19, 28, 32, 39 and 

42 were transfected with plasmids containing the ?. aeruginosa alkaline protease gene 
(aprA) which is a known type I secretory protein. The protease was shown to be 
secreted at levels comparable to the levels of such protease reported by Awran and 
Smit for C. crescenrus transformed in the same way. Thus, the transport mechanism 

3 0 in the non-C. crescenrus species are Type I mechanisms capable of recognizing 

diverse Type I (C-terrninal) secretion signals. 
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The following recombinant DNA and DNA sequencing methods are described 
in Awram, P. and J. Smit (1998) and may be used with appropriate adaptation in this 
invention. These procedures may be used in screening suitable Caulobacter for use as 
host organisms and for identification of Caulobacter of this invention. E. coli DH5 a 

5 (Life Technologies) was used for all E. coli cloning manipulations. E. coli was 
grown at 37oC in Luria broth (1% tryptone, 0.5% NaCl, 0.5% yeast extract) with 
.1.2% agar for plates. Caulobacter was grown at 3<X: in PYE medium 
(0,2% peptone, 0.1% yeast extract, 0.01% CaCl 2 , 0.02% MgS0 4 ) with 1.2% agar 
for plates. AmpicUlin was used at 100 ^/ml, streptomycin was used at 50 ^g/ml, 

0 kanamycin was used at SOug/ml, and tetracycline was used at 0.5 ^/ml for 
Caulobacter and at 10 pg/ml for E. cpjj when appropriate. 

Standard methods of DNA manipulation and isolation were used. 
Electroporation of Caulobacter was performed as described above. Southern blot 
5 hybridizations were done in accordance with the membrane manufacturer's manual 
(Amersham Hybond-N). Radiolabeled probes were made by nick translation using 
standard procedures. 

PCR product containing rsaD and rsaE was generated using primers 
C 5 -CGG AATCGCGCT ACGCGCTGG-3 ' (SEQ ID NO:2) and 
5 -GGG AGCTCG AAGGGTCCTG A-3 ' (SEQ ID NO:3). Product was generated 
using Tag polymerase (Bethesda Research Laboratories) and following the 
manufacture's suggested protocols. Following a 5-min denaturation at 95 C, two 
cycles of 1 min at 42°C, 2 min at 65 °C, and 30s at 95 °C were followed by 25 cycles 
5 of 1 minat 55 0 C,2minat65<>C, and 30 s at 95 °C. The vector pBSKS+ Stratagene 
was cut at the £coRV site and T tailed. The PCR product was ligated into this vector 
so that rsaD and rsaE would be in the same orientation as the IacZ promoter of 
pBSKS + . This construct was called pRAT5. 



0 

Plasmid pBBR5 was constructed from plasmids pBBRlMCS (Kovach, M.E 
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et_al. (1994) BioTechniques 16:800-802) and pHP45Q-Tc (Fellay, R. etal. (1987) 
Gene 52:147-154). The q-Tc fragment from P HP45q-Tc was removed by using 
HindW, and the ends were blunted by using T4 polymerase. A 0.3-kbp portion of the 
Cmy-encoding gene was removed from pBBRlMGS b7 cutting with Oral and replaced 
5 with the blunted q-Tc Fragment, producing a Tct broad-host-range vector that 
replicates in Caulobacter . 

Plasmid pRAT4 A H was made by removing the Clal-Hindm fragment from 
pTZ18UB:rsa A P (Bingle, W.H. et_ai. (1997) J. Bacterid. 179:601-611) and replacing 
1 0 it with the Clal-Hindm fragment from pRATl containing the C-terminus of rsaA and 
the complete rsaD and rsaE genes. 

A NA1000 cosmid library (Alley, M.R. etal. (1991) Genetics 129:333-341) 
was probed with radiolabelled rsaA. 1 1 cosmid clones hybridizing' to the probe were 

IS isolated. Southern blot analysis was used to determine which cosmids contained DNA 
3' of rsaA. An 11.7kb Srtl-EcoRl fragment containing rsaA plus 7.3 kb of 3' DNA 
was isolated from one of the cosmids and cloned into the S&l-EcoRl site of 
pBSKS+; the resulting plasmid was named pRATl. The 3' end of the cloned 
fragment consisted of 15 bp of pLAFRS DNA containing Sau3A\, Smal; and £coRI 

2 0 sites. 

BamVH fragments from pRATl were subcloned into the BamKl site of vector 
pTZ18R for sequencing. The 3'-end fragment was subcloned into pTZ18R by using 
BamHl and fcoRI. The 5'^nd fragment was subcloned into pTZ18R by usinfi 

2 5 Hindtn. Sequencing was performed on a DNA sequencer (Applied BiosystemsTw 

model 373). After use of universal primers, additional sequence was obtained by 
"walking along" the DNA using 15-bp primers based on the acquired sequence. 
Nucleotide and amino acid sequence data were analyzed by using GeneworksTu and 
MacVectonM software (Oxford Molecular Group) or the National Center for 

3 0 Biotechnology Information BLAST e-mail server using the BLAST algorithm. 

Protein alignments were generated by using the ClustalW™ algorithm as implemented 
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Sequences for rsaD and rsaE have been assigned GenBank Access™ 
No. Ah062345. The proteins have the following sequences: 

5 

RsaD (SEQ ID NO:4): 
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MFKRSGAKPT I FDQAVLVARPAVI TAM^SFFINIIALVSPLYMI^VYDRVXTST^HVSTLIVLTVICVFI, 
FLVYGI^Al^TQVLVRGGlJ<JTXr^PIFKS^ 

W^PWSimPITeiLMXACXXXIiHAVHNDHXTIOT^ 

frWGCLQARWRARI^EQVAWQAAASDAGGAVMSG X XVFWTVOTLXLGGGAYLAXDGKI SAGAMIAGS ILV 
CRMAP I E GAVGO>f)Q*Y I GARGAWDRLO. TMXAE E KSADDHMPLPE PRGVLSAEAAS I LPPGAQQ PTMRQA 
SnUDAGAAVALVGPSAAGXSSI^CIVGVWPCAAG^ 
VAQNIAJ^FESOEVIEAATIAGVHEMIOSLI^GYDTAIGXC^ 
15 DE PNASI^CJVGZ VAI>1EAMKRIJCAAKRTVI FATHXVNIXAQAEOf IMVTNOGVI SDFGERDRCWPS 

RsaE (SEQ ID NO:5): 



MKPPKIQRPTDNFQAVARXGYG2 XALTFVGlJO<AArAPlCSAVlA>«?VVSAEVSQrWQHIiG<>lLAKIX, 
2 0 VRE GEKVKAGQVUXI^D PTQ AKAAAGX TRNQ YVAlJ<AMEARliAERDQRPS I SFPADLTSQRADPMVARA 
lADEQAQFTEBRQTI^^l^QRl^yQSEIEGIDRQTOGXKDQMF^ 

LLAIXARAGS LSG S I GR1TADRS KAVQGASD TQLiKVRO. X KQE PTE QVSQS I TE TRVRXAEVTE KEWASD 
AOKRJKIVSFVNGTAQNXJUTOGAVVT^PLV^ 

HSAGNPDPERHDPVAVADR2 SDPQKCARl^XGIVRVDVKCjI^PHliRGlWTAGMPAQVTVPTGERrvi^QYl. 

2 5 FSPLRDTTLRTTMREE 

The Table below sets out results of initial sequencing of S-layer related genes 
in 2 strains of C. crescentus (NA1000 and CB2) and four non-C. crescentus species of 
S-layer producing freshwater Caulobacter (FWC6, 8, 19, 27 and 39). Genes 

3 C identified as A, D and E are the S-layer structural gene, the ABC transporter gene and 

the membrane fusion protein (MPF) respectively. In the C. crescentus strains, the 
latter genes are rsaA. rsaD. and rsaE, respectively. The transport protein genes are 
highly conserved within and among the species. Within the C. crescentus species, 
there is high conservation of the rsaA gene, including the C-terminal secretion signal. 
3 5 A region of the S-layer gene in FWC27 outside the secretion signal region shows 
clear divergence from the equivalent region in the two C. crescentus strains. 
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Funher sequencing studies comparing the S-Iayer protein gene (A), the 
ABC-transporter gene (D) and the MFP gene (E) of FWCl, 9. 19, 39 and 42 to the 
rsaA, rsaD and rsaE genes of C. crescentus strain CB15A produced the following 
5 results. FWCl and FWC19 respectively exhibited about 32% and about 27% identity 
over the last 300 C-terminal amino acids of the A gene, as compared to rsaA. For the 
last 100 C-terminal amino acids, identity of FWCl and FWC19 sequences to rsaA 
was about 50% and 41% (respectively), with the most significant identity being in~d* 
last 62 amino acids. Sequencing of various 35 to about 350 amino acid segments of 
10 the D gene from FWCl, FWC9, FWC19. FWC39 and FWC42 resulted in sequence 
identity to rsaD of at least about 79%. Sequencing of large portions of the E gene 
from FWC19 and FWC42 (about 368 and about 290 amino acids respectively) 
demonstrated about 85% and about 73% identity (respectively) to the rsaE gene. 

15 Approximately the last 100 C-terminal amino acids of the A gene for FWCl 

(SEQ ID NO:6) and FWC19 (SEQ ED NO:7) are set out below. 



FWCl 

20 TTDTUCFANTG-TETFTSTKVDLTG^ 

TWFQYGGNTYT\'EDRDAGNTFN T VATDIV\TaTGAV'DLST-AVLSAFGRRS 
SLTLV 



FWC19 

RAHMllJa>TRHVSDRWGRHYARLVQLPGRPCPKl^DAATTGNASHKV 

SWFVY(}GDTYLVKMSTLAPPSKT.\RriVVKLTGTTNDLTK-A 
TLTLG 

This invention now being described, it will be apparent to one of ordinary skill 
in the an that changes and modifications can be made thereto without departing from 
the spirit or scope of the appended claims. 
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1- A hos, ceU for expression and secrerion of a he,er 0 ,ogous po.vpen.ide 

*« - «u „ . comprising at leas , one Mrf j 

" ' h ° S ' ' DN * consouc comprising L 

<ncod,ng a polvpep.ide hettrologom „ , ^ * 

■ ^ Pr0VB ° WhCT "» «" «W --port pro.eins having s ^ 
» - bo* SEQ ID NO, a* SEQ ID N0:5 . *, secret sigra| J „„, ~ 



2. _ T* ceU of ciaim , vtaKin « least one of ^ proteim Qf ^ 
an amino acid sequence the same as SEQ ID N0 . 4 or SEQ ID N0:5 . 

3. TTk ceU of Cairn 2 having a*** proteins with ^ m 
seance as SEQ ID NO:4 ar* SEQ n> N0:J , and wherein the secre.ion signa. does 
not comprise SEQ ID NO: 1. 



15 



JO 4. The cell of claim 1, itjrf-wherein th* nviA - 

u, . .*m^nerein the DNA construct further comprises an 

? operably linked promoter recognized by the cell. 

5. A method for identifying a ^acter suitable for use as a host cell for 
express.on and secretion of a heterologous polypeptide comprising: 

(a) extracting DNA from a candidate Caulobacter; 

(b) contacting the DNA with an oligonucleotide capable of selective 
hybnduanon to a nucleotide sequence encoding SEQ ID NO:4 or SEQ ID NO* and 

3 0 



(c) determining whether the oligonucleotide hybridizes 



to the DNA. 
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6. The method of claim 5 wherein the oligonucleotide is labelled and said 
determining is by detection of the presence of the label bound to the DNA. 

7. The method of ciaim 5 wherein said determining is by amplification of DNA 
5 with the oligonucleotide as a primer, followed by detection of an amplification 

product. 

8. A DNA construct comprising one or more restriction sites for facilitating 
insenion of DNA into the construct, wherein the construct further comprises DNA 

10 encoding a Caulobacter surface layer protein secretion signal not present in 
C. crescentus . 

9. A DNA construct comprising DNA encoding a polypeptide not present in 
Caulobacter surface layer protein 5' from and operatively linked to DNA encoding a 

1 5 Caulobacter surface layer protein secretion signal not present in C. crescentus . 

10. The DNA construct of claim 9 ebte=fcrther comprising an operably linked 
promoter recognized by Caulobacter . 

2 0 11. The DNA construct of claim 8, jb>c±fc-wherein the secretion signal has an 
amino acid sequence which does not comprise SEQ ID NO:l. 

12. A bacterial cell comprising a DNA constnict of claim 9, -Hkej£th 
25 13. The cell of claim 12, wherein the cell is a Caulobacter . 

14. The cell of claim 12, wherein the cell is C. crescentus . 

15. The cell of claim 13 or 14, wherein the DNA construct further comprises an 
30 operably linked promoter recognized by Caulobacter and wherein the DNA construct 

is expressed in the cell arid the protein so expressed is secreted by the cell. 
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16. A secreted protein obtained from a cell of claim 15, wherein the protein 
comprises one or more portions heterologous to a surface layer protein of the cell and 
wherein the protein has a C-terminal portion comprising a surface layer protein 
secretion signal not present in C. crescentus . 

5 
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